Adaptive Critic Designs - Neural Networks, IEEE Transactions on

نویسنده

Danil V. Prokhorov

چکیده

We discuss a variety of adaptive critic designs (ACD’s) for neurocontrol. These are suitable for learning in noisy, nonlinear, and nonstationary environments. They have common roots as generalizations of dynamic programming for neural reinforcement learning approaches. Our discussion of these origins leads to an explanation of three design families: Heuristic dynamic programming (HDP), dual heuristic programming (DHP), and globalized dual heuristic programming (GDHP). The main emphasis is on DHP and GDHP as advanced ACD’s. We suggest two new modifications of the original GDHP design that are currently the only working implementations of GDHP. They promise to be useful for many engineering applications in the areas of optimization and optimal control. Based on one of these modifications, we present a unified approach to all ACD’s. This leads to a generalized training procedure for ACD’s.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Critic Learning Techniques for Engine Torque and Air-Fuel Ratio Control

A new approach for engine calibration and control is proposed. In this paper, we present our research results on the implementation of adaptive critic designs for self-learning control of automotive engines. A class of adaptive critic designs that can be classified as (model-free) action-dependent heuristic dynamic programming is used in this research project. The goals of the present learning ...

متن کامل

An ART-based fuzzy adaptive learning control network

This paper proposes a reinforcement fuzzy adaptive learning control network (RFALCON), constructed by integrating two fuzzy adaptive learning control networks (FALCON), each of which has a feedforward multilayer network and is developed for the realization of a fuzzy controller. One FALCON performs as a critic network (fuzzy predictor), the other as an action network (fuzzy controller). Using t...

متن کامل

Neurocontroller alternatives for "fuzzy" ball-and-beam systems with nonuniform nonlinear friction

The ball-and-beam problem is a benchmark for testing control algorithms. In the World Congress on Neural Networks, 1994, Prof. L. Zadeh proposed a twist to the problem, which, he suggested, would require a fuzzy logic controller. This experiment uses a beam, partially covered with a sticky substance, increasing the difficulty of predicting the ball's motion. We complicated this problem even mor...

متن کامل

A deployed engineering design retrieval system using neural networks

We describe a neural information retrieval system (NIRS), now in production within the Boeing Company, which has been developed for the identification and retrieval of engineering designs. Two-dimensional and three-dimensional representations of engineering designs are input to adaptive resonance theory (ART-1) neural networks to produce clusters of similar parts. The trained networks are then ...

متن کامل

Intelligent supply chain management using adaptive critic learning

A set of neural networks is employed to develop control policies that are better than fixed, theoretically optimal policies, when applied to a combined physical inventory and distribution system in a nonstationary demand environment. Specifically, we show that model-based adaptive critic approximate dynamic programming techniques can be used with systems characterized by discrete valued states ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Adaptive Critic Designs - Neural Networks, IEEE Transactions on

نویسنده

چکیده

منابع مشابه

Adaptive Critic Learning Techniques for Engine Torque and Air-Fuel Ratio Control

An ART-based fuzzy adaptive learning control network

Neurocontroller alternatives for "fuzzy" ball-and-beam systems with nonuniform nonlinear friction

A deployed engineering design retrieval system using neural networks

Intelligent supply chain management using adaptive critic learning

عنوان ژورنال:

اشتراک گذاری